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Abstract — In this paper, we propose a novel adaptive reduced- 
rank strategy for very large multiuser multi-input multi-output 
(MIMO) systems. The proposed reduced-rank scheme is based 
on the concept of joint iterative optimization (JIO) of filters 
according to the minimization of the bit error rate (BER) 
cost function. The proposed optimization technique adjusts the 
weights of a projection matrix and a reduced-rank filter jointly. 
We develop stochastic gradient (SG) algorithms for their adaptive 
implementation and introduce a novel automatic rank selection 
method based on the BER criterion. Simulation results for 
multiuser MIMO systems show that the proposed adaptive 
algorithms significantly outperform existing schemes. 
□ 

Index Terms- Multiuser MIMO systems, massive MIMO, 
reduced-rank methods, adaptive algorithms, BER cost function. 

I. Introduction 

Wireless communication research has recently focused on 
multi-input multi-output (MIMO) systems in order to exploit 
the increased capacity offered by the use of multiple an- 
tennas, and improve the quality and reliability of wireless 
links Qj. In MIMO systems, two configurations can be 
employed, namely, diversity and spatial multiplexing, which 
exploit spatial diversity to combat fading and increase the data 
rates by transmitting independent data streams, respectively. 
In particular, spatial multiplexing can be used for multiuser 
MIMO systems to transmit multiple data streams that can be 
separated using signal processing techniques at the receiver. 
More recently, multiuser detection has been considered in 
conjunction with MIMO techniques, which is widely believed 
to play an important role in future communication systems Q, 
13, Q. A recent trend has been introduced with the concept 
of massive MIMO (6| and the investigation of algorithms 
[for very large MIMO systems Q, El, which present key 
technical challenges for designers. Central problems in very 
large multiuser MIMO systems are the tasks of detection 
and parameter estimation that are required for interference 
suppression and must deal with a large number of parameters. 

In this context, reduced-rank signal processing is a very 
promising technique due to its ability to deal with a large 
number of parameters. It has received significant attention in 
the past several years, since it provides faster convergence 
speed, better tracking performance and an increased robustness 
against interference as compared to conventional schemes 
operating with a large number of parameters. A number of 
reduced-rank techniques have been developed to design the 
subspace projection matrix and the reduced-rank filter []9]- ifTTll . 
Among the first schemes are eigendecomposition-based (EIG) 
algorithms (9), (TO). The multistage Wiener filter (MWF) has 
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been investigated in [ 1 1 1 and [12], whereas the auxiliary vector 
filtering (AVF) algorithm has been considered in ifPTl . EIG, 
MWF and AVF have faster convergence speed with a much 
smaller filter size, but their computational complexity is very 
high. A strategy based on the joint and iterative optimization 
(JIO) of a subspace projection matrix and a reduced-rank 
filter has been reported in 1(141 . lfj"5l . whereas algorithms with 
switching mechanisms have been considered in [17| for DS- 
CDMA systems. 

Most of the contributions to date are either based on the 
minimization of the mean square error (MSE) and/or the 
minimum variance criteria [9|-[17|, which are not the most 
appropriate metric from a performance viewpoint in digital 
communications. Design approaches that can minimize the bit 
error rate (BER) have been reported in [18], [19], [20] and are 
termed adaptive minimum bit error rate (MBER) techniques. 
The work in ll20l appears to be the first approach to combine 
a reduced-rank algorithm with the BER criterion. However, 
the scheme is a hybrid between an EIG or an MWF approach, 
and a BER scheme in which only the reduced-rank filter is 
adjusted in an MBER fashion. 

In this paper, we propose adaptive reduced-rank techniques 
based on a novel JIO strategy that minimizes the BER cost 
function for very large multiuser MIMO systems. The pro- 
posed strategy adjusts the weights of both the rank-reduction 
matrix and the reduced-rank filter jointly in order to minimize 
the BER. We develop stochastic gradient (SG) algorithms for 
their adaptive implementation and present an automatic rank 
selection method with the BER as a metric. Simulation results 
for large multiuser MIMO systems show that the proposed 
algorithms significantly outperform existing schemes. 

The paper is structured as follows. Section |TT] briefly de- 
scribes the multiuser MIMO system model. The derivation 
of the MBER reduced-rank algorithm is described in section 
[TTT1 The complexity analysis of the proposed algorithm and 
the automatic rank selection scheme are introduced in section 
IIVI The simulation results are presented in section [V] Finally, 
section [VT1 draws the conclusions. 

II. System Model 

Let us consider the uplink of an uncoded synchronous 
multiuser MIMO system with K users and one base station 
(BS), where each user is equipped with a single antenna and 
the BS is with M uncorrected receive antennas, K < M. We 
assume that the channel is a MIMO time-varying flat fading 
channel. The M-dimensional received vector is given by 



r{i) = J2A k h k {i)b k (i) +n(i), 

k=l 



(1) 



where bk(i) € {±1} is the i-th symbol for user fc, and the 
amplitude of user fc is ^4^, fc = 1, . . . , K. The M x 1 vector 
hfc(i) is the channel vector of user fc, which is given by 



hfc(i) = [/ife,i(«) ■ ■ ■ h ktM {i)Y 



(2) 



whose elements hkj(i), f = 1,...,M, are independent 
and identically distributed complex Gaussian variables with 
zero mean and unit variance, n(i) = [ni{i) . . . nM{i)] T 
is the complex Gaussian noise vector with zero mean and 
E[n(i)n H (i)] = <r 2 I, where a 2 is the noise variance, (.) T and 
(.) H denote transpose and Hermitian transpose, respectively. 



III. Design of MBER Reduced-Rank Schemes 

In this section, we detail the design of reduced-rank schemes 
which minimize the BER. In a reduced-rank algorithm, an 
M x D projection matrix Srj is applied to the received data 
to extract the most important information of the processed data 
by performing dimensionality reduction, where 1 < D < M. 
A D x 1 projected received vector is obtained as follows 



r(i) = Sgr(i), 



(3) 



where it is the input to a D x 1 filter = [wi ,W2, ■ ■ ■ , wd] T - 
The filter output is given by 



z fe (i) = wfr(i)=wfSgr(i). 
The estimated symbol of user fc is given by 
6 fc (i) = sign{3*[wf r(t)]}, 



(4) 



(5) 



where the operator 3?[.] retains the real part of the argument 
and sign{.} is the signum function. The probability of error 
for user k is given by 



P e = P(x k < 0) = f f{Z k )dx k 

J — oo 

' sign{b k (i)}dt{x k (i)} 



Q 



p(wf SgS D w fe )2 



(6) 



where x k = sign{&fe(z)}3?[xfe(z)], f{x k ) is the single point 
kernel density estimate lfl8l which is given by 



f(xk) 



1 



-(x k -sign{b k (i)}$l{x k (i)}) 2 



x exp 



(7) 



where p is the radius parameter of the kernel density estimate, 
Q(.) is the Gaussian error function. The parameters of Sd 
and Wfc are designed to minimize the probability of error. By 
taking the gradient of © with respect to w£ and after further 



mathematical manipulations we obtain 



dP e 



exp 



q I sign{6fc (»)}5K[zfc(-t)] 
Vp(wf SgS D w fc )2 



2tt 

mix k (i)]\ 



dwt 



eX P { 2p^wf SgS D w 



sign{6 fc (i)} 



SR[x fc (z)]SgS c w fc 
(wfSgS D w fe )5 (wfSgS^Wfc)! 

(8) 

By taking the gradient of ((6} with respect to S* D and following 
the same approach we have 



dP e 

dS* 



exp 



d 



2p 2 w«SgS D w fc I ' \ p( ^H SgS D w fc 



sign{b fc (i)}5R[x fc (i)] 
T 
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sign{6 fe (?)} 
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Spwfcwf SR[z fc (i)] 



IV. Proposed MBER Adaptive Algorithms 



(9) 



In this section, we firstly describe the proposed scheme and 
MBER adaptive SG algorithms to adjust the weights of Sj^(i) 
and w(i) based on the minimization of the BER criterion. 
Then, a method for automatically selecting the rank of the 
algorithm using the BER criterion is presented. 



r(i) Projection 
Matrix 



r(i) Reduced-Rank 

»- Filter 

w t (i) 



»(0 



Ret. ) 











Adaptive 
JIO-MBER 
Algorithm 



4*(0 



1, 



Fig. 1 . Structure of the proposed reduced-rank scheme 

A. Adaptive Estimation of Projection Matrix and Receiver 

The proposed scheme is depicted in Fig. Q] the projection 
matrix Sd(«) and the reduced-rank filter vr k (i) are jointly 
optimized according to the BER criterion. The algorithm has 
been devised to start its operation in the training (TR) mode, 
and then to switch to the decision-directed (DD) mode. The 
proposed SG algorithm is obtained by substituting the gradient 
terms ([8) and (0 in the expressions \v k (i + 1) = Wfc(i) — 
A^H* and S D (i + l) = S D (i)- ns D §s^ [21] subject to the 
constraint of w^(i)Sg(«)S£)(i)wfe(«) = 1. Based on IfTSll . we 
can see that, with respect to the product Sjj\v k , there are only 
global minimum solutions, and all the solutions form a half 
hyperplane. In this work, we pick one of the MBER solutions 



for S^Wfe, which is with the unit length. At each time instant, 
the weights of the two quantities are updated in an alternating 
way by using the following equations 



exp 



w fe (i + 1) = w fe (i) + 



2p 2 



sign{6 fc (i)} 



2V2vrp 

x (Sg(i)r(i) - R[i fc (i)]Sg(i)S D (t)w fc (»)) 



exp 



S D (i + 1) = Sd(i) + MS £ 

x (r(i)wf (») - 



-\M[x k (i)]\ 
2p 2 



(10) 

sign{6 fc (i)} 



2V27T/5 

Sz3(i)w fc (i)wf(i)3i[5 fe (i)] 



(11) 



where /j, w and are the step-size values. Expressions 

( ITOb and dTTb need initial values, Wfc(0) and Sxj(O), and we 
scale the reduced-rank filter by wt = , =S at each 

iteration. The scaling has an equivalent performance to using a 
constrained optimization with Lagrange multipliers although it 
is computationally simpler. The proposed adaptive JIO-MBER 
algorithm is summarized in table U 

TABLE I 

Proposed adaptive JIO-MBER algorithms 

~ Initialize w fe (0) and S D (0). 

2 Set step-size values /i w and fis D 

3 for each time instant i = 0, 1, • • • do 

4 Compute w fc (i + 1) and S D (i + 1) using (To) and filY 

5 Scale the using wj = , Wfe 

6 Obtain w^(i + 1) and S£)(i + 1) for the next time instant. 



The joint optimization of and Sd has been shown to 
converge to the global minimum when the MSE is employed 
as the cost function lfT31 . The proposed scheme promotes an 
iterative exchange of information between the transformation 
matrix and the reduced-rank filter, which leads to improved 
convergence and tracking performance. However, when the 
BER is used as the cost function, there are local minima 
associated with the optimization. 

B. Computational Complexity of Algorithms 

We describe the computational complexity of the proposed 
JIO-MBER adaptive algorithm in multiuser MIMO systems. 
In Table [TTJ we compute the number of additions and mul- 
tiplications to compare the complexity of the proposed JIO- 
MBER algorithm with the conventional adaptive reduced-rank 
algorithms, the adaptive least mean squares (LMS) full-rank 
algorithm based on the MSE criterion lED and the SG full- 
rank algorithm based on the BER criterion |[T8l . Note that 
the MWF-MBER algorithm corresponds to the use of the 
procedure in ifTTI to construct So(i) and ( TTOb to compute 
Wfc(z). In particular, for a configuration with M = 32 and 
D = 6, the number of multiplications for the MWF-MBER 
and the proposed JIO-MBER algorithms are 7836 and 1225, 
respectively. The number of additions for them are 5517 and 
933, respectively. Compared to the MWF-MBER algorithm, 



the JIO-MBER algorithm reduces the computational complex- 
ity significantly. 



TABLE II 

Computational complexity of Algorithms. 



Algorithm 


Number of operations per symbol 


Multiplications 


Additions 


Full-Rank-LMS 


2M + 1 


'2M 


Full-Rank-MBER 


4M + 1 


AM - 1 


MWF-LMS CD 


DM 2 - M 2 


DM 2 - M 2 


+2DM + 4D + 1 


+3D - 2 


EIG (TO) 


0(M 3 ) 


0(M 3 ) 


JIO-LMStH) 


3DM + M 


2DM + M 




+3D + 6 


+4D - 2 


MWF-MBER HO 


(D+ 1)M 2 


(D - 1)M 2 




+{3D + 1)M + 3D 


+(2D- 1)M 




+M + 10 


+2D + M + 1 


JIO-MBER 


6MD + 5D 


5MD + D 




+M+11 


-M - 1 



C. Automatic Rank Selection 

The performance of reduced-rank algorithms depends on 
the rank D, which motivates automatic rank selection schemes 
to choose the best rank at each time instant IfTTI . lfT31 . IfTTI . 
Unlike prior methods for rank selection, we develop a rank 
adaptation algorithm based on the probability of error, which 
is given by 



P D (i) = Q 



sign{b k (i)}$t[x°(i)] 



P 



(12) 



where the receiver is subject to S^Scw^ = 1. For 
each time instant, we adapt a reduced-rank filter Wfe(i) and 
a projection matrix Srj(i) with the maximum allowed rank 
D mM , which can be expressed as 



w fc (i) = [wi(i), 



S D (t) 



(<),... 

Si,£> mi „M 

%,D min (i) 



S 1.D„ 



(13) 



(14) 



allowed for the reduced-rank filter, respectively. For each 
symbol, we test the value of rank D within the range, namely, 
Dmin < D < -D m ax- For each tested rank, we substitute the 
filter w fe (i) = [u>i(i), . . . ,uiD(i)] T and the matrix 



MO 



h,i(i) 



SM,l( i ) 



h,D(i) 



SM,D{i) 



(15) 



into ( fT2l to obtain the probability of error Po(i). The optimum 
rank can be selected as 



-D op t(«)=arg min Po(i). (16) 

DE{D min ,...,D max } 



The proposed MBER automatic rank selection requires the 
operation with Z? max to calculate 



M.32, K=17, SNR.15dB 



-D 



£>min 

+ £ sfr(i)v d ), 

d=-Dopt + l 



(17) 



where is a zero vector with a one in the dth position and 
s d = [si,d(i)> * * * ) SM,d(*)] T - A simple search over the values 
of x?(i) and the selection of the terms corresponding to D opt 
and f*D p t (i) are performed. 

V. SIMULATIONS 

In this section, we evaluate the performance of the proposed 
JIO-MBER reduced-rank algorithms and compare them with 
existing full-rank and reduced-rank algorithms. Monte-carlo 
simulations are conducted to verify the effectiveness of the 
JIO-MBER adaptive reduced-rank SG algorithms. The number 
of receive antennas at the BS is M = 32. The channel 
coefficient hk,f(i) is computed according to the Jakes model 
ll22l . We optimized the parameters of the JIO-MBER adaptive 
reduced-rank SG algorithms with step sizes fi w = 0.01 
and ns D = 0.025. The step sizes for LMS adaptive full 
rank, SG adaptive MBER full rank and the conventional 
adaptive reduced-rank techniques are 0.085, 0.05 and 0.035, 
respectively. The initial full rank and reduced-rank filters are 
all zero vectors. The initial projection matrix is given by 
S_d(0) = [Id, Odx(m-d)] T - The algorithms process 250 
symbols in TR and 1500 symbols in DD. We set p = 2a. 



M=32, K=7, SNR=15 dB 




— ©— MBER full-rank 

□ LMS full-rank 
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number of received symbols 

Fig. 2. BER performance versus the number of received symbols for the JIO- 
MBER reduced-rank algorithms and the conventional schemes. (D m j n = 3, 

Dmax =20, K = 7) 

Figs.|2]and[3]show the BER performance of the desired user 
versus the number of received symbols for the JIO-MBER 
adaptive SG algorithms and the conventional schemes. We 
set the rank D — 8 for the reduced-rank schemes, and the 
normalized Doppler frequency is fdT s — 1 x 10 -5 . We use 
15 dB for the input signal to noise ratio (SNR). From Fig. [2] 
and [3] we can see that the proposed JIO-MBER SG algorithm 




number of received symbols 



Fig. 3. BER performance versus the number of received symbols for the JIO- 
MBER reduced-rank algorithms and the conventional schemes. (D m i n = 3, 
D max =20, K = 17) 



with the automatic rank selection mechanism achieves the 
best performance. Although the full-rank MBER SG algorithm 
has a better performance compared to the proposed JIO- 
MBER SG algorithm with D = 8 for a system with a low 
load, the proposed JIO-MBER SG algorithm with D = 8 
outperforms the full-rank MBER SG algorithm for a highly- 
loaded system. We also can see that the JIO-MBER reduced- 
rank algorithms converge much faster than the conventional 
reduced-rank algorithms, and the MBER eigen-decomposition 
reduced-rank method with D = 8 does not work well in 
time-varying MIMO fading channels. For the group of JIO- 
MBER adaptive algorithms, the auto-rank selection algorithms 
outperform the fixed rank algorithms. 




O MBER full-rank 

□ LMS full-rank 

^ MWF-MBER, D: 

Q proposed MBER-JIO, D- 

^ proposed MBER-JIO auto 



SNR (dB) 

Fig. 4. BER performance versus SNR for the JIO-MBER reduced-rank 
algorithms and the conventional schemes. 1500 symbols are transmitted and 
250 symbols in TR. (D mi7l = 3, D max = 20, K = 10) 



Figs. |4] and [5] illustrate the BER performance of the desired 
user versus SNR and number of users K, where we set 
fdT s = 1 x 10~ 5 and D — 8. We can see that the best per- 
formance is achieved by the proposed JIO-MBER algorithm 



M=32, SNR=15dB 
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number of users (K) 

Fig. 5. BER performance versus number of users for the JIO-MBER 
reduced-rank algorithms and the conventional schemes. 1500 symbols are 
transmitted and 250 symbols in TR. (D m i n = 3, D max = 20, SNR = 15 
dB) 



with the automatic rank selection mechanism. The proposed 
JIO-MBER algorithm with D — 8 outperforms the MWF- 
MBER reduced-rank algorithm. For the low-SNR region and 
the high-load case, the proposed JIO-MBER algorithm with 
D = 8 outperforms the full-rank MBER SG algorithm. In 
particular, the JIO-MBER algorithm using the automatic rank 
selection mechanism can save up to over 5dB and support up 
to six more users in comparison with the full rank MBER SG 
algorithm, at the BER level of 6 x 10~ 3 . 

M=32, K=17, SNR=15dB 
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Fig. 6. BER performance versus the (cycles/symbol) for the JIO-MBER 
reduced-rank algorithms and the conventional schemes. 1500 symbols are 
transmitted and 250 symbols in TR. (D m i n = 3, D max = 20, SNR = 15 
dB, K =11) 



We show the BER performance of the analyzed algorithms 
as the fading rate of the channels vary. In this experiment, we 
set the number of users K = 17 and SNR = l5dB. In Fig. 
we can see that, as the fading rate increases the performance 
gets worse, and the proposed JIO-MBER algorithm with the 
automatic rank selection mechanism achieves the best perfor- 
mance, followed by the proposed JIO-MBER algorithm with 
D = 8, the full-rank MBER SG algorithm, the conventional 



MWF-MBER algorithm and the full-rank LMS algorithm. It 
shows the ability of the proposed JIO-MBER algorithms to 
deal with dynamic channels. 

VI. Conclusions 

In this paper, we have proposed a novel adaptive MBER 
reduced-rank scheme based on joint iterative optimization of 
filters for multiuser MIMO systems. We have developed SG- 
based algorithms for the adaptive estimation of the reduced- 
rank filter and the projection matrix, and proposed an auto- 
matic rank selection scheme using the BER as a criterion. The 
simulation results have shown that the proposed JIO-MBER 
adaptive reduced-rank algorithms significantly outperform the 
existing full-rank and reduced-rank algorithms at a low cost. 
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